Joint Maximum Margin and Maximum Entropy Learning of Graphical Models
نویسنده
چکیده
INFERRING structured predictions based on correlated covariates remains a central problem in many fields, including NLP, computer vision, and computational biology. Typically, both the input covariates and output predictions can be high-dimensional, multi-modal, noisy, partially observable, and bearing latent structures, each of these characteristics adds a degree of complexity to the task of learning structured input/output (I/O) models. Several recent approaches to structured I/O learning are based on learning discriminative probabilistic graphical models. By defining composite features that explicitly exploit the structured dependencies among input elements (e.g., words in a sentence) and among the interpretational outputs (e.g., part-of-speech tags), such models can produce semantically consistent predictions from complex inputs. However, how to train such models properly remains a highly contested issue. The two dominant paradigms for training such models are the maximum (conditional) likelihood estimation (MLE) [6], which leads to the well-known CRF, and the max-margin learning [9], [10], which leads to the MN. While both methods have enjoyed remarkable success and are widely used, they have a number of deficiencies, as we will discuss below. In this project, we introduce a new paradigm for learning structured I/O models, and graphical models in general, that conjoins and extends the merits of MLE and maxmargin learning while avoiding their shortcomings.
منابع مشابه
Maximum Entropy Discrimination Markov Networks
Standard maximum margin structured prediction methods lack a straightforward probabilistic interpretation of the learning scheme and the prediction rule. Therefore its unique advantages such as dual sparseness and kernel tricks cannot be easily conjoined with the merits of a probabilistic model such as Bayesian regularization, model averaging, and ability to model hidden variables. In this pape...
متن کاملConditional and joint models for grapheme-to-phoneme conversion
In this work, we introduce several models for grapheme-tophoneme conversion: a conditional maximum entropy model, a joint maximum entropy n-gram model, and a joint maximum entropy n-gram model with syllabification. We examine the relative merits of conditional and joint models for this task, and find that joint models have many advantages. We show that the performance of our best model, the joi...
متن کاملPartially Observed Maximum Entropy Discrimination Markov Networks
Learning graphical models with hidden variables can offer semantic insights to complex data and lead to salient structured predictors without relying on expensive, sometime unattainable fully annotated training data. While likelihood-based methods have been extensively explored, to our knowledge, learning structured prediction models with latent variables based on the max-margin principle remai...
متن کاملThe Integration of Dependency Relation Classification and Semantic Role Labeling Using Bilayer Maximum Entropy Markov Models
This paper describes a system to solve the joint learning of syntactic and semantic dependencies. An directed graphical model is put forward to integrate dependency relation classification and semantic role labeling. We present a bilayer directed graph to express probabilistic relationships between syntactic and semantic relations. Maximum Entropy Markov Models are implemented to estimate condi...
متن کامل22 : Hilbert Space Embeddings of Distributions Lecturer : Eric
The application of classical optimization techniques to Graphical Models has led to specialized derivations of powerful paradigms such as the class of EM algorithms, variational inference, max-margin and maximum entropy learning. This view has also sustained a conceptual bridge between the research communities of Graphical Models, Statistical Physics and Numerical Optimization. The optimization...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010